Dark Energy from Large-Scale Structure Lensing Information 
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Wide area Large-Scale Structure (LSS) surveys are planning to map a substantial fraction of the 
visible universe to quantify dark energy through Baryon Acoustic Oscillations (BAO). At increasing 
redshift, for example that probed by proposed 21-cm intensity mapping surveys, gravitational lensing 
potentially limits the fidelity (Hui et al., 2007) because it distorts the apparent matter distribution. 
In this paper we show that these distortions can be reconstructed, and actually used to map the 
distribution of intervening dark matter. The lensing information for sources at z = 1 — 3 allows 
accurate reconstruction of the gravitational potential on large scales, I < 100, which is well matched 
for Integrated Sachs- Wolfe (ISW) effect measurements of dark energy and its sound speed, and a 
strong constraint for modified gravity models of dark energy. 

We built an optimal quadratic lensing estimator for non-Gaussian sources, which is necessary for 
LSS. The phenomenon of "information saturation" (Rimes & Hamilton, 2005) saturates reconstruc- 
tion at mildly non-linear scales, where the linear source power spectrum A 2 ~ 0.2 — 0.5. We find 
that steeper power spectra, saturate more quickly. We compute the effective number densities of 
independent lensing sources for LSS lensing, and find that they increase rapidly with redshifts. For 
LSS/21-cm sources at z ~ 2 — 4, the lensing reconstruction is limited by cosmic variance at I < 100. 



I. INTRODUCTION 

The standard cosmological model has achieved sub- 
stantial quantitative success through precision cosmol- 
ogy. It requires a mysterious and dominant dark energy 
component, which is not physically understood. Or it 
may be a hint that Einstein's General Relativity is not 
the correct theory to describe our universe. 

New quantitative tests are needed to improve our un- 
derstanding of the nature of dark energy or modified 
gravity. In this paper we show how high-z Baryon Acous- 
tic Oscillations (BAO) surveys, in addition to measuring 
kinematic cosmology, can also constrain the dynamics 
of perturbations through weak gravitational lensing. A 
direct measurement of the gravitational potentials can 
determine the sound speed of the dark energy field [1] . 

Recently, modified gravity models have been studied 
quantitatively. In order to pass local precision grav- 
ity tests, and match the Cosmic Microwave Background 
(CMB) fluctuations, these models tend to differ in their 
predictions only on very large scales and at low redshifts. 

Direct all-sky lensing maps are observationally chal- 
lenging, since the signals are weak, and systematics are 
difficult to control. However, quadratic estimators of 
CMB maps have recently demonstrated its feasibility on 
very large scales [2]. Similar techniques have been pro- 
posed for lensing of diffuse structures [3], and will be ex- 
plored in further detail in this paper. 

The kinematic history of the universe is being mapped 
out using the standard metre stick of BAO. A range of 
surveys are underway and have been proposed to produce 
a coarse image of large-scale structure (LSS) over a sub- 
stantial sky area and redshift range. While these surveys 
are not directly suitable for lensing mapping, the maps 
they produce will be distorted by gravitational lensing 
[4] . Since the intrinsic correlations is known to be statis- 



tically isotropic, the lensing induced changes can be used 
to reconstruct the dark matter map, in analogy with lens- 
ing of the CMB. The CMB is a source screen with well 
known lensing properties, for which lensing and ISW re- 
sults in a non-zero three point function. As we show in 
this paper, the kernels are not perfectly matched, and 
the dynamical dark energy properties can be better mea- 
sured with sources at a range of redshifts, as is the case 
for 21cm LSS. This general approach is known as lensing 
tomography. 

Some of the proposed fast BAO surveys [5-7] are based 
on an intensity mapping approach, where the LSS is 
mapped without detection of individual galaxies over a 
large fraction of the sky. For such surveys, point source 
based lensing mappings are not applicable [8], and the 
LSS lensing is the only available tool. Lensing recon- 
struction using Gaussian random fields as sources is now 
well understood for both 2-D and 3-D structures [3, 9- 
13]. Metcalf and White [13] have explored the possibility 
of applying 21-cm lensing from low and high redshifts to 
the dark energy constraint. While their method is based 
on the Gaussian assumption, it was shown that the ne- 
glect of non-Gaussianity could generate orders of mag- 
nitude difference in the noise estimation [12] (hereafter 
LP08). 

In general, lensing reconstruction is a quadratic func- 
tion of the density field, which is quantified by its power 
spectrum. Rimes and Hamilton [14] (hereafter RH05) 
showed that the Fisher information available in the power 
spectrum saturates at trans-linear scales, and stays sev- 
eral orders of magnitude below the Gaussian Fisher in- 
formation content. In this paper, we combine the infor- 
mation from these approaches, and construct the optimal 
quadratic lensing estimators for non-linear sources fields 
from N-body simulations. 

We find that the reconstruction noise of lensing from 
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the simulated sources decreases with reduced experimen- 
tal noise at linear scales, saturates at quasi-linear scales, 
and drops again at highly non- linear scales. There is a 
plateau region at the trans-linear regime, which is analo- 
gous to what was found with the Fisher information con- 
tent in the initial amplitude of matter power spectrum by 
Rimes and Hamilton [14] . Both effects are caused by the 
non-Gaussianity (or non-linearity) introduced by gravita- 
tional clustering during structure formation. We found 
the saturation scale is a steadily increasing function of 
the shape of the linear power spectrum. 

This non-Gaussianity makes the 21-cm lensing less 
promising than it first appears. However, because of 
the abundance of 21-cm sources, the Signal-to-Noise ra- 
tio (S/N) of 21-cm lensing at high redshift z ~ 1 — 6 
is still competitive compared with other surveys. The 
effective number density of sources defined by the inde- 
pendent number of cells in the 21-cm source, increases 
quickly with redshift. At redshift 4 — 6, this number den- 
sity could be 9arcmin~ 2 . 

The paper is organized as follows: We overview the 
progress in the LSS information and lensing studies in 
section II. The optimal lensing estimator is introduced 
in section III. The numerical methods are presented in 
section IV. The results are discussed in section V. We 
conclude in section VI. 



II. INFORMATION AND LENSING FROM THE 

LSS 

The lensing of Gaussian random fields can be described 
in several ways. For the purposes of this paper, we are 
in a regime where the fluctuation scale of the sources is 
smaller than the structures of interest in the lenses. In 
this limit, lensing has two effects on an image: conver- 
gence (k) and shear (7). 

As we will find through detailed calculations, these 
two quantities carry a comparable amount of informa- 
tion, dependent on the slope of the power spectrum of 
sources. For pedagogical purposes, we first consider the 
convergence, that is the stretching of the image changes 
the amplitude of fluctuations at a fixed apparent angu- 
lar scale. By comparing the power spectrum in different 
patches, we can measure the relative convergence. 

The accuracy of this procedure is proportionate to the 
precision with which the power spectrum can be mea- 
sured. This problem has been studied in terms of Fisher 
Information by RH05, and their results directly trans- 
late into the accuracy of lensing reconstruction. After a 
technical calculation, we find that the effective number of 
independent lensing sources is equal to the RH05 Fisher 
Information, scaled by a coefficient of order unity, which 
depends on the slope of the power spectrum. 

A related problem is the impact of lensing on the sky 
averaged power spectrum [4, 15, 16]. There had been 
concern that the lensing at low redshift may impact the 
ability to accurately measure the matter power spectrum 



at high redshift. To first order, there is an equal number 
of positive and negative convergence patches, so the net 
effect is second order. Since we can measure the first 
order effect on each patch, we can actually correct its 
impact, and cancel off the second order bias. 



III. LENSING RECONSTRUCTION 

In the lensing of diffuse background, what is observed 
is the distorted brightness temperature distribution. As 
discussed in LP08, all lensing estimators are quadratic 
terms of temperature fields. The lensing field (conver- 
gence k, shear 7, or deflection angle d) can be recon- 
structed with the product of two smoothed temperature 
fields. For different estimators, the window functions 
could also be scalars or vectors. 

How to choose the smoothed window is just a matter 
of optimization. There are also progressive levels of op- 
timization to be considered: Is the source isotropic? Is 
the window function isotropic? For Gaussian distributed 
temperature field, the optimal window function can be 
solved analytically. Hu and Okamoto [17] formulated the 
optimal quadratic deflection angle estimator (OQDE) for 
2-D CMB lensing, and later Zahn and Zaldarriaga [10] 
generalized it to 3-D 21-cm lensing. The OQDE surpris- 
ingly has the same form at all scales, even though an 
additional scale dependent normalization factor needs to 
be applied at the last step. In an alternative approach, 
by optimizing the reconstruction noise in the zero-mode, 
we derived the optimal Gaussian window for the conver- 
gence and shear fields separately in LP08. 

In reality, 21-cm sources are not Gaussian fields. For 
simplicity, we use the 3-D dark matter distribution to 
represent the 21-cm emission at the same redshift. We 
investigated both OQDE and our estimators by numer- 
ical tests with simulated non-Gaussian source fields and 
mocked Gaussian ones. For Gaussian sources, we found 
the combined result of our two estimators has exactly 
the same S/N as the OQDE on all scales. This agrees 
with our expectation because of the scale independence 
of the optimal window functions. If one window function 
is optimal at one scale, it is also optimal at other scales. 
For non-Gaussian sources, however, the optimal window 
function derived from Gaussian assumption is not opti- 
mal any more. The lensing reconstruction noise could 
be underestimated by orders of magnitude if we use the 
results from Gaussian sources, which could be calculated 
analytically by applying Wick's theorem. By choosing 
appropriate weights, our combined estimator has better 
performance than the OQDE. 

In this paper, we will further explore the optimization 
of non-Gaussian sources. We will develop the optimal 
non-Gaussian estimator, which can only be constructed 
by numerical measurements from a large sample of sim- 
ulations. The reconstruction noise of optimal lensing es- 
timators for non-Gaussian 21-cm sources are closely re- 
lated to the Fisher information contained in non-linear 
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matter power spectrum described by RH05. They indi- 
cated that the Fisher information for the initial ampli- 
tude of matter power spectrum could be written as the 
sum of the inverse of covariance matrix of power spec- 
trum estimates, multiplied by the partial derivative of 
power spectrum with respect to the initial amplitude. 
They have revealed the cumulative Fisher information re- 
sults up to a maximum scale, which enters the calculation 
as the upper limit scale of the covariance matrix. They 
found a very interesting phenomenon: the Fisher infor- 
mation increases at linear scales, and then the growth 
becomes very slow at the quasi-linear region (which they 
label as information plateau), and the Fisher information 
starts to rise again in highly non-linear scales. 

In this context, Fisher information means how accu- 
rately the amplitude of the power spectrum can be mea- 
sured. As we will discuss later in section III A, lens- 
ing can be detected by the change of the source power 
spectrum. We will also prove that the optimal window 
function of non-Gaussian sources actually contains the 
inverse covariance matrix of power spectra. Thus the re- 
construction noise of lensing roughly estimates how much 
information is contained in the source power spectrum: 
the lower the noise level achieved, the more information 
is gained. Instrumental noises of 21-cm experiments can 
be approximately treated as hard cut-offs at some scales, 
then we could inspect the reconstruction noises with dif- 
ferent cut-off scales. When the sources are Gaussian, the 
covariance matrix is diagonal and could be expressed as 
a square term of the source power spectrum, and the op- 
timal window function reverts to the form we found in 
LP08. Our numerical results with Gaussian estimators 
had similar behavior as RH05: the reconstruction S/N 
increased first and reached a peak at trans-linear scales. 
At non-linear scales, the S/N dropped again. We will 
explain later, the decline of S/N is an artificial effect by 
using the non-optimal window function for non-Gaussian 
sources. In this paper, we will show that the optimal non- 
Gaussian estimator also leads to a plateau in trans-linear 
scales in the S/N plot, and like Fisher information of 
matter power spectrum amplitude, the curve falls again 
in non-linear scales. 

The covariance matrix of non-linear power spectrum 
is measured from an ensemble of N-body simulations. 
RH05 showed that the covariance of power spectrum can 
only be measured with at least hundreds of independent 
simulations. Therefore we generated around 100 or more 
different sources, depending on the redshift, by running 
the same number of N-body simulations. 



A. Optimal non-Gaussian estimator 

As we explained in LP08, for the lensing of a diffuse 
background, all unbiased estimators are two-point func- 
tions of the source temperature field with each points 
convolved with a window function, no matter which 
quantity is reconstructed: the convergence, shear, de- 



flection angle or the potential. 

E(x)=f wl {x)f w2 (x), 



(1) 



where T w i,f w2 are two convolved temperature fields by 
some window functions, e.g., 



f wl (x) = d 2 x'f(x')Wi(x-x'). 



(2) 



Here T is the lensed temperature field, and we use T to 
represent the intrinsic unlensed temperature field. We 
will keep using this convention throughout the paper: 
the variables with tilde mean the quantities with lensing, 
while those without tilde mean the original ones. To 
optimize the estimator, one minimizes its variance 



\E) = (fl 1 fl 2 ). 



(3) 



which is a four-point function of the temperature distri- 
bution. Except for a Gaussian distribution, four-point 
functions are in general not analytical, and can not be 
calculated from two-point statistics. 

In LP08, we derived optimal estimators of the conver- 
gence and shear for a Gaussian source distribution. Since 
the optimal window function for Gaussian sources (here- 
after optimal Gaussian estimator or optimal Gaussian 
window function) does not depend on the scale, we were 
able to look for its form in the limit of a slowly varying k 
(or 7). We then applied a maximum likelihood method 
to solve the optimal window. The calculation was done 
analytically assuming that the covariance matrix of the 
power spectrum is diagonal and can be obtained from the 
power spectrum, which is the implication of Wick's the- 
orem. The variance of optimal Gaussian estimators rises 
when the resolution of the observation is improved, which 
shows the optimal Gaussian estimators are far from op- 
timal for a non-Gaussian source distribution. 

To find the optimal estimator for non-Gaussian 
sources, we pursue the calculation in a slightly differ- 
ence approach. We adopt a minimum variance method, 
demonstrate it with the Gaussian case and generalize it 
to the non-Gaussian case. First, we solve the optimal 
estimator for a constant n and a Gaussian source, i.e., 
k(x) — k . We then generalize the estimator to the op- 
timal non-Gaussian estimator of varying k and 7. 

The lensed power spectrum from observations can be 
Taylor expanded in first order as: 



P tot (k) = P(k) + P N (k) 

« P(k) + P N (k) + 



dP(k) 



dn 



(4) 



K=0 



where the noise P N = P e + P s includes both the instru- 
mental noise, P e , and the statistical fluctuation of the 
power spectrum, P s , which is the sample variance for a 
Gaussian source. In LP08, we have defined the deriva- 
tive of the lensed power spectrum with respect to k, i.e., 
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G(k, fe) = 9P q^ ■ For convenience, we will use G to rep- 



resent its value at n = 0, therefore 



G(k) w 2P(fc) + AP(fc). 



(5) 



where AP 



P'k(k 2 ± /k 2 ), P'(k) = dP(k)/dk, k± 
means the component of fc in the transverse plane of 
the line of sight, as defined in LP08. Note that here 
P(k) is the expectation value of source power spec- 
trum estimate, and P(k) is the lensed power spec- 
trum of one realization. In the absence of lensing, the 
power spectrum in one realization is measured by an 
estimator P(fc), which is equal to the sum of P(k) 
and P N (k). We frequently use the approximation that 
((P tot (k) - P(k)) 2 ) » ((P(fc) - P(k)) 2 ) in the recon- 
struction noise estimation. To simplify the algebra, we 
first consider the case P e = 0, and P N = P s . It will 
later be straightforward to add the instrumental noise. 



1. Discrete case 

Following the convention of Bertschinger [18], we use 
subscript [...]*. to describe discrete quantities, and paren- 
thesis [...](&) to describe continuous quantities. For ex- 
ample, 



/oo 
d 3 xf ( 
-oo 

f k = Y,T{x j )e- i 



x)e 



(6) 



Note that the observed temperature field is from the 
contribution of lensed brightness temperature field and 
noise: T = Tf, + n. The expectation value of the source 
power spectrum is P k = (T bk T£ k ), and T b k is the dis- 
crete Fourier transform of Ti,(x). If the source brightness 
temperature follows a Gaussian distribution, the estima- 
tors of the unlensed power spectrum Pj~ at different k 
are independent, and their variance can be calculated as 
Var(P fc ) = (P^ 2 ) = (P% 2 ) = P£/n kb , where n kb is the 
number of modes in the k frequency bin. 

We first consider the discrete case with an indepen- 
dent set of Fourier frequencies k. For every frequency, 
an estimator of k can be constructed as: 



K 



Gk 



(7) 



where nk = P k /Gk- Note kk is not the Fourier trans- 
form of k(x), which has a constant value, but the mea- 
surement of kq at frequency k. The optimal estimator of 
Ko should be the total contribution from all k appropri- 
ately weighted. The minimum variance estimator of k 
is 



KO = 



£V*2 



(8) 



where cr| 



(n k n k ) = Pk/ n k b G k . Therefore, the weight 
at each k is inverse proportional to the a 2 ,. The recon- 
struction noise of the minimum variance estimator is now 



Var(/t ) 



1 



(9) 



More generally, in the non-Gaussian case, power spec- 
trum at different k are correlated. We can write all vari- 
ables in the form of matrices and vectors: 



Gk 



P P 



N 



(10) 



where G, P and P are the Nk x 1 matrix composed 
by Gk,Pk and P k respectively, and Nk is the number 
of all frequencies. The minimum variance (least square) 
estimator can be written as 



G*C _1 Gko = G*C- 1 (P tot P P N ) 



(11) 



where Ck,k> = (Pk Pk>) * s the covariance matrix for the 
power spectrum. Since the terms introduced by lensing 
do not dominate in the noise, we have neglected their 
contribution in the covariance matrix C. All noise related 
calculations in the following text will use P instead of 
P tot , while the estimator of lensing quantities will always 
use p tot . Therefore the minimum variance estimator for 
the non-Gaussian source is 

k = (G^^Gr^CT^P* * - P - P N ) . (12) 

and the variance of the estimator is 



Var(«o) - (G*C -1 G)- 



2. Continuous case 



(13) 



We now consider the continuous case. Assuming the 
source is a cube with physical length L in each dimension, 
by definition f(k) = f k L 3 , P(k) = P k L 3 ,G(k) = G k L 3 
and C(k,k') = Ck,k'L 6 - As such, taking the continuum 
limit leads to 



Kq = 



1 f d 3 k d 3 k' 



QJ (27r) 3 (27r) 3 
d 3 A- 

1 ' P tot (k)T(k)-V , 



C- 1 (k,k')G(k r )(P tot - P - P N )(k) 



(2n) 3 



(14) 



where 

Q 

V 



Uwr c ~ 1{k > k ' )G{k% (15) 



d 3 fc d 3 k' 



(2.) 3 (2.) 3C_1(fe ' fe ' )G(fe)G(fe,) ' (16) 

/c\ 3 k 
— (P(k)+P N (k))T(k). (17) 



V is the mean variance of the smoothed temperature 
field. Note that when the source is Gaussian, the op- 
timal filter is T(k) = G(k)/P{k) 2 /Q. This is the 
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estimator presented in LP08. Using T(k)T*(k) — 
(27r) 3 <5 3D (0) J P tot (fe) = L 3 P tot (k) and Parseval's theorem, 
where S 3B is the 3-D Dirac delta function, we can write 
the calculation in real space instead of Fourier space 

/c\ 3 k ~ 
—T(k)T*(k)f(k)-V 

= L~ 3 J d 3 xf wl (x)f w2 (x)-V 

= f wl (x)f w2 (x)-V, (18) 

where the two window functions to smooth the temper- 
ature field are the decomposition of T, i.e., W\Wi = T. 
The noise of k is 

(19) 

where C(fc, fe') = ((P(fc) - P(fc))(P(fc') - P(fc'))) is the 
covariance matrix of the power spectrum. Similar to Eq. 
(12) in the discrete case, we have neglected the contribu- 
tion from lensed terms in the estimate of the covariance 
matrix C. It is clear that the noise for k equals Q _1 . 
C(k,k') will be measured from simulations. 
Since 

P(fc±,fc||) = |Jr 1 P(J- 1 fe_ L ,fc||) 

w (l + 2K)[P(fc) + AP(fc)(K + 7 iCos20 fci 
+72 sin 20 fe J], (20) 

where 9 k± is the angle between and the transverse 
coordinate (LP08), the first order Taylor expansion gives 

G 71 (fe) = AP(k)cos26 k± , (21) 
G 72 (fc) = AP(k)sm29 k± . (22) 

We will rewrite the equation in spherical coordinates us- 
ing k = (k,6,(p), kj_/k 2 = sin 2 6, and 9 k± = (f>. 

IV. NUMERICAL METHODS 

We will calculate the reconstruction noise for sources 
at redshift: 1.25, 3 and 5. The power spectrum co- 
variance matrix will be measured from N-body sim- 
ulations. Unless mentioned explicitly, WMAP5 cos- 
mological parameters are used throughout the paper: 
Q m = 0.258, 0\ = 0.742, tl b = 0.0441, a 8 = 0.796, n s = 
0.963, h= 0.719, r= 0.087 [19]. 

A. Simulations 

We generate the N-body simulations with the 
CUBEPM code. CUBEPM is MPI paralleled particle- 
mesh (PM) code, and has particle-particle forces imple- 
mented at sub-grid scales. It is further parallelized by 
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FIG. 1: Power spectrum at z=0, 1.25, 3, and 5. The data points 
are from simulations, and the lines are the predicted non-linear 
power spectra generated by CAMB. The non-linearity increases 
with lower redshift. The power spectra from simulations at small 
scales drop due to the resolution limits. The parameters of the 
simulations are shown at tabic I. 



shared-memory OpenMP on each node. The simulation 
volume (which is also called simulation box) is cubically 
decomposed to n 3 sub-volumes, and the calculation of 
each sub- volume is performed on one node of cluster. 
The total number of nodes used in simulation is n 3 . The 
code has been run on up to 1000 nodes. We used n = 2 
and 3 in our simulations. The simulations are run on 
the SUNNYVALE cluster of Canadian Institute for The- 
oretical Astrophysics (CITA), which is a Beowulf cluster 
composed of 200 Dell PE1950 nodes. For each node there 
are 2 quad core Intel(R) Xeon(R) E5310 @ 1.60GHz pro- 
cessors, 4GB of RAM, and 2 gigE network interfaces. 

HI gas is distributed in galaxies. If we ignore the bias 
between the galaxies and dark matter distributions, we 
could approximately use dark matter to represent 21-cm 
sources distributions at these redshifts (the simple toy 
model in LP08) since HI sources are expected to trace 
dark matter fairly well. We have generated the source 
distribution with CUBEPM and we output the 3-D par- 
ticles distribution at redshifts 1.25, 3 and 5. To cor- 
rectly measure the covariance matrix of power spectrum, 
we generate 186 independent simulations at z — 1.25. 
Because the non-linear lity is less at higher redshift, we 
have run 100 simulations at z — 3 and 90 simulations 
at z = 5. The density field is produced by assigning 
the mass of particles to nearby grids with Cloud in Cell 
(CIC) method. 

As we will illustrate later, the 3-D Fisher information 
(as defined in RH05) of the amplitude of source power 
spectrum has similarity with the reconstruction noise. 
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redshift 


L{hr x M V c) 


A 2 (2tt/L) 




A 2 (k Ny ) 


k s 







300 


0.012 


512 


219 


0.14 


0.46 


1.25 


300 


0.0036 


512 


39.3 


0.28 


0.34 


3.0 


100 


0.015 


1024 


40.5 


0.47 


0.19 


5.0 


50 


0.020 


1728 


26.5 


0.99 


0.16 



TABLE I: Source simulation parameters. We choose these simulations to optimize the computation load while maintaining the required 
resolution. The simulation parameter choice is validated by looking at the convergence of the Fisher information content of the power 
spectrum in both linear and non-linear scales. We found the upper limit for the non-linear power spectrum at the fundamental modes, 
which has the same size as the box, is around 0.02; and the lower limit for non-linear power spectrum at the Nyquist frequency fcjvj/i is 
around 20. The values of non-linear power spectra are estimated from CAMB. k s are the saturation scales of the Fisher information, as 
shown in Fig. 2. We also list the values of the linear power spectra at these saturation scales for all four redshifts. 



We thus address the convergence as a function of the 
number of simulations with a Fisher information plot. 
We check the values of ^2 kk , <k C kk ,, where C is 
the covariance matrix of the normalized power spectrum 
P(k)/(P(k)), that is, the Fisher information is inversely 
proportional to the variance of the amplitude of normal- 
ized power spectrum. We have considered runs with dif- 
ferent resolutions and box-sizes for each redshift. The 
box-size needs to be big enough so that not much of the 
linear modes are cut-off by the limited box-size. On the 
other hand, the non-linear structure needs to be resolved 
at small enough scales so that the saturation effect can 
be seen. We confirmed the convergence at linear scales 
by comparing the results with the Gaussian prediction, 
and at non-linear scales by comparing with higher res- 
olution simulations. We found the upper limit for the 
non-linear power spectrum at the fundamental modes, 
which has the same size as the box, is around A 2 < 0.02; 
and the lower limit for non-linear power spectrum at the 
Nyquist frequency kpf v , is around A 2 > 20. The simu- 
lations parameters we finally chose are given in table I, 
and the power spectra are shown at Fig. 1. The val- 
ues of the non-linear power spectrum are estimated from 
CAMB, and thus can be obtained before running simula- 
tions. The actual power spectra from N-body simulations 
are always lower than that at fc/vj, because of the finite 
resolution. The sources at z — 1.25 are produced with 
simulations with 300/i _1 Mpc box, and 256 3 particles on 
512 3 grids. For z = 3, because the matter distribution is 
more linear, we need to increase the resolution to reach 
the non-linear structures at smaller scales. We did this 
by using a smaller box with L — 100/i _1 Mpc and more 
refined grids with n gr id = 1024. At z = 5, the non- 
linearity is lower, and we choose a 50ft. _1 Mpc box with 
1728 3 grids. For the sources at z = 3 and z ~ 5, we have 
assigned the particles distributions to 512 3 grids though 
the N-body simulations were performed on finer meshes 
for the sake of computation efficiency. We also include 
the Fisher information at z = 0, using the same simula- 
tion parameters as those at z = 1.25. 

The inversion of a large covariance matrix with a few 
hundred of elements on each dimension can be numeri- 
cally challenging. Following RH05, we divide the power 
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FIG. 2: The 3-D Fisher information content of the amplitude of 
power spectrum at z = 0, 1.25, 3.0, 5.0 respectively. All results are 
normalized to the values for a 300/i — 1 Mpc box. The dashed line 
is the Gaussian prediction. It is clear that the Fisher information 
curves grow as fc 3 at linear scales, then turn flat at quasi-linear 
scales (fc ~ 0.5 — 2/iMpc~ 1 for z = 1.25), before going up again at 
non-linear scales. As expected, there is a gradual evolution of the 
plateau from z = to z = 5. 



spectra to Nb = 20 bins uniformly distributed in log 
scale, and calculate the associated band power spectra 
and their covariance matrix. Subsequently, we invert the 
Nb x Nb matrix instead. The Fisher information can be 
seen in Fig. 2. All results are normalized to the values for 
a 300ft- _1 Mpc box, i.e., multiply the Fisher information 
measured by (300/i) 3 because larger volume has more 
independent modes. It is clear that the Fisher informa- 
tion curves grow as fc 3 at linear scales, then turn flat at 
quasi-linear scales (k ~ 0.5 — 2/iMpc~ 1 for z = 1.25), 
before going up again at non-linear scales with a quasi- 
Gaussian scaling. There is a gradual evolution of the 
plateau from z = to z = 5. 
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FIG. 3: Convergence test of the Fisher information as a function 
of simulations. We increase the number of simulations in the top 
panel. The relative difference from the 186 runs result is 2% for 
100 runs, and 10% for 50 runs. When the number of simulations is 
less than 30, the deviance is comparable to the Fisher information 
itself. The bottom panel is for different box-sizes, both types have 
186 runs. The error bars are generated by the bootstrap method. 
The Fisher information from simulations with L = 150/i," 1 Mpc 
agrees reasonably well with the one from 300h -1 Mpc simulations. 
As such, the lower limit of power spectrum at fundamental mode 
should be equal to or higher than the value found in the 150/i _1 
Mpc simulations. 



We show the convergence of the Fisher information as 
a function of both the number of simulations n s ; m and the 
resolution in Fig. 3. In the top panel, n s ; m are reduced 
from 186 to 100, 50, 30 and 20. The relative difference 
from the 186 runs result is 2% for 100 runs, and 10% for 
50 runs. When the number of simulations is less than 
30, the deviation is comparable to the Fisher informa- 
tion itself. We emphasize here that the required number 
of simulations also depends on the number of bins JV b : 
Nb has to be much smaller than the number of simula- 
tions for the covariance matrix to be non-singular. In 
the bottom panel, we show the convergence of two sets 
of simulations with different box-sizes. Both types have 
186 runs. The Fisher information from simulations with 
L = 150/i _1 Mpc agrees reasonably well with the one 
from 300/i _1 Mpc simulations. We thus deduce that the 
lower limit for the power spectrum at fundamental mode 
should be equal to or higher than the value found in the 
150/i _1 Mpc simulations. 



B. Lensing reconstruction 

We now study the lensing reconstruction noise that will 
be measured directly from reconstructed lensing maps 



with the optimal quadratic estimator described in section 
III. 



1. Power spectrum covariance matrix 

The power spectrum covariance matrix can be written 
as (P(fc)P(fe')) - (P(k)){P(k')). Because the source is 
isotropic and stationary, (P(fc)P(fc')} can be expressed 
in a symmetric way C(k,k' , cos 9k,k'), where cos9k,k' is 
the angle between the two vectors on the two shells de- 
noted by k and k' . We can then expand C in spherical 
harmonic functions, and because it is independent of 0, 
the expansion can be written as the sum of the Legendre 
functions 

oo 

C{k,k', cos k ,k') = ^Q(M') p /(cos0 fe ,fe')- (23) 

1=0 

Here I is a even integer because C(k,k' , cos 6k,k') is an 
even function of 9. The window function can be written 
as an integral of the form 

r c\ 3 k' 

^ (fe)cX J ^ C_1 (*' + GS(*')]> (24) 



where 



Gg(fc) = 2P + 2P'fc/3, 
G${k) = P'k(- cos 2 6 k . 



+ 1/3), 



(25) 
(26) 



which arc proportional to the zcroth order and second 
order Legendre function Pq (cos 9k' ) and P2 (cos 9y ) ■ For 
each fe, we can always choose the z-axis along it so that 
cos 9k,k' is equal to cos 9y without loss of generality. Be- 
cause the orthogonal property of the Legendre functions, 
only zeroth and second terms in C _1 remain, all the 
higher orders cancel out by the integral over k' . 

In other words, we decompose the optimal window 
function in two orthogonal components from the zeroth 
and second order of the covariance matrix expansion 
terms respectively. For Gaussian sources, the power spec- 
trum modes are uncorrelated with other modes in differ- 
ent directions or on different shells, therefore all C; are 
equal. For non-Gaussian sources, C2 is about an order 
of magnitude higher than Co, so the additional informa- 
tion of lensing obtained by using C2 is negligible (private 
communication with Joachim Harnois-Dcraps, Harnois- 
Deraps et al. [20]). 

In Fig. 4, we show the optimal non-Gaussian window, 
and the optimal Gaussian window function in Fourier 
space. The latter window has an almost power-law slope, 
while the former one has both positive and negative val- 
ues due to the complicated behavior of inverse covariance 
matrix C _1 . We use crosses to represent the absolute 
values of the negative part of the optimal non-Gaussian 
window. 

In this paper, we will reconstruct the estimator using 
the zeroth order only. Modes on a shell can be binned as 
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FIG. 4: The optimal non-Gaussian window, and the optimal Gaus- 
sian window function in Fourier space. The latter window has an 
almost power-law slope, while the former one has both positive and 
negative values due to the complicated behavior of inverse covari- 
ance matrix C _1 . We use crosses to represent the absolute values 
of the negative part of the optimal non-Gaussian window. 

a group first. Wc could simplify the estimator by calcu- 
lating its 1-D equivalent, e.g., replacing P(k) with P(k). 
Working in lower dimension facilitates the numerical cal- 
culation, especially the inversion of covariance matrix. 
Rather than working with a 6-D array C(fe, fe'), we now 
only need to calculate the covariance matrix 

C(fc, k') = (P(k)P(k')) (P(k))(P(k')) . (27) 

We treat the modes on each k shell as independent 
components and the window function corresponding to 
this shell is calculated from the overall contribution of 
these modes. We use another trick in the calculation of 
window function by replacing ^ C _1 (fc, k')G(k') with 

When considering the shear, the calculation is very 
similar. G 71 = P'k sin 2 9 cos 2(f) can also be written as 
the sum of Legendre functions: 

Gq 1 = 2P'k cos 20/3, (28) 
Gf = P'fc(l/3-cos 2 <9)cos2(/>. (29) 

Therefore only the zeroth and second order mode of co- 
variance remain, and we will also just use the zeroth order 
in the reconstruction of 71 . For 72, one just need to re- 
place cos 2(j) by sin 2(f). The convergence can be calculated 
from the reconstructed shear. We will call this conver- 
gence, 7b. Similar to LP08, we can choose the axis of 
the coordinate to be parallel to the direction of the mode 
measured, so that je =71. 



FIG. 5: The contribution from one box at z = 1.25 to the vari- 
ance of various estimators. The variances are averaged over 186 
independent boxes. The reconstruction noises for the Gaussian 
window, the optimal window for k, and the optimal window func- 
tion for 7b are presented. At linear scales, i.e., when £ < 1000, 
the two estimators have the same reconstruction noise. The non- 
Gaussian window has a factor of a few less noise than the Gaus- 
sian estimator, and displays a plateau above the saturation scale. 
The plateau means that the non-Gaussianity of sources caused by 
the effective number of cells increases slowly below the saturation 
scales, i.e., the structures are intrinsically correlated below such 
scales and external factor such as experimental noise will not have 
much effect. The Gaussian estimator used in LP08 does not de- 
crease at saturation scale, and even starts to increase at smaller 
scales. This is an artifact of the non-optimality of the Gaussian 
estimator for non-Gaussian sources. We also show the results from 
Gaussian sources as dashed (shear) and dotted lines (convergence). 
The slight non-monotonicity in the shear variance probably comes 
from the dropping of the quadrupolar C'2 term (see text). 

2. Convolution 

The optimal estimator contains the inverse covariance 
matrix C _1 , which we measure using around 100 — 200 
sources. For 512 3 grids, there are about 256 modes cov- 
ering the range from the box-size scale to the Nyquist 
scale. The inversion of the 256 x 256 covariance matrix 
will be unstable, because the number of eigenvalues is 
more than the total number of independent samples. As 
such, we will keep using the 20 bins in our calculation. 
The binned window function are less optimal, but the dif- 
ference is a minor disadvantage compared to the stability 
of the inversion. Another important term in the window 
function is the gradient of the power spectra. Since the 
power spectrum is close to a power-law, we calculate the 
gradient by finding the tangent of the power spectrum 
at saturation scale k s measured from the Fisher infor- 
mation on the log-log plot. We find P'k = ti^lP, and 
«nl = —1.6, —1.9 and —2.1 for z = 1.25,3 and 5 respec- 
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tivcly. 

On the 512 3 grids, we calculate the window function 
with the band power, inverse matrix, and gradient. The 
Nearest Grid Point (NGP) method is used to map these 
band values to the 3-D grids in Fourier space. From 
Eq. (20), we know that the optimal window function docs 
not only depend on the amplitude k, but is also a function 
of k± . Note that when considering the shear, there is an 
extra angular dependence on 0k ± , the direction on the 
transverse plane that we need to take into account when 
computing the window function. 

Finally, the 3-D sources are convolved with the opti- 
mal window function. Since the window functions arc 
generated in Fourier space, we can simply transform 
the temperature fields to Fourier space by Fast Fourier 
Transform (FFT), multiply by the window functions, and 
transform them back to real space, k or je maps are inte- 
grals of the covariance maps of the two convolved fields 
along the line of sight. Since we are interested in the 
zero mode, which is the average value of the covariance, 
we calculate the reconstruction noise of the combined es- 
timator of k and je (as in LP08), which decrease the 
noise level. To validate the procedure, we compare the 
reconstruction noises with the analytical predictions in 
the Gaussian simulations with the same power spectra 
as the simulated sources. 



V. NUMERICAL RESULTS AND DISCUSSION 
A. Reconstruction noise 

The optimal estimators were derived in the constant 
k(j) limit (zero-mode). For the other scales, we need to 
scale the estimator by a normalization factor b{£). The 
derivation of b{£) can be found in appendix A. Not sur- 
prisingly, b{£) < 1, and decreases from 1 to when I 
increases, for any given estimator. For the scales we are 
interested in - above the characteristic scale £ a - b(£) is 
close to 1, and the noise properties are similar to the zero- 
mode one. To illustrate the comparison between different 
estimators at various £ with the of zero-mode reconstruc- 
tion, we treat the experimental noise as a hard cut-off 
at k c . l a ~ £ c /2 = x( z s)k c /2. This is motivated by the 
fact that the reconstructed noise is proportional to fc~ 3 
on linear scales, and the contribution to the reconstruc- 
tion at k < k c /2 are small and can be treated like the 
zero-mode. The values of l a are shown in Fig. 8 and 9. 

In Fig. 5, the zero-mode k reconstruction noises from 
various estimators are presented as functions of various 
cut-off scales at z = 1.25. For this plot, the noises cor- 
respond to the contribution from a single simulation box 
of width 300/i _1 Mpc, and is measured over 186 indepen- 
dent boxes. As a reference, we present the reconstruction 
noises from the Gaussian window, non-Gaussian window 
for k and the non-Gaussian window for te. On linear 
scales, i.e., when I < 1000, the two estimators of k 
have the same reconstruction noise. At smaller scales, 



the non-Gaussian window reduces the noise by factor 
of a few compared to the Gaussian estimator, and has 
a plateau after the saturation scale. The experimental 
noise smears out the structure in the sources, and regions 
which arc originally independent become correlated. In 
other words, lower noise level leads to a larger effective 
number of independent source cells. Similarly, the ef- 
fective number of cells increases if the experiment is re- 
solved at smaller scales, therefore the reconstructed noise 
is lower. The plateau means that the non-Gaussianity of 
sources cause the effective number of cells to increase very 
slowly below the saturation scale, i.e., the structures are 
intrinsically correlated below this scale and external fac- 
tors such as experimental noise will have little effect. The 
Gaussian estimator we used in LP08 also saturates at sat- 
uration scales, and even increases at smaller scales. This 
is an artifact effect coming from the fact that the Gaus- 
sian estimator is non-optimal for non-Gaussian sources. 
Note also that shear has better S/N level than k. To com- 
pare with the simulated sources, we also show the results 
from Gaussian predictions calculated using Wick's theo- 
rem. In fact, the reconstruction noises in the Gaussian 
case can be approximated by the following power-law re- 
lationship: 

Var(K<f auss ) « 3^ 2 L- 3 fc- 3 (l + ^)- 2 , 
Var( 7 g auss ) « 45^Zr 3 fc- 3 n N 2 . (30) 

Note that iinl can not be -3 or 0, otherwise the vari- 
ance of the estimator is infinity and the lensing signal 
can not be reconstructed. An intuitive explanation is 
that, in these cases the variance of temperature field are 
conserved even after being lensed, therefore lensing maps 
can not be distinguished from unlcnscd ones and lensing 
can not be extracted. This stems from the fact that we 
only consider the zero order, Co, in the covariance matrix 
of matter power spectrum. Lensing can still be solved for 
iinl = —3, cases if C2 is taken into account in the esti- 
mator. 



B. Saturation effects 

We define the saturation scale k s as the wavenumbcr 
where the noise from the Gaussian prediction is equal 
to the average amplitude of reconstructed noise in the 
plateau. We have discussed the saturation effects of lens- 
ing reconstruction in LP08, where we found that at z = 7 
the S/N saturates at the equivalent scale k s where the 
non-linear power spectrum of source A 2 (fc,s) ~ 0.2. The 
corresponding linear power spectrum at k s is about 0.17, 
which is consistent with the results at other redshifts 
shown in table II. It is not clear why the value of Af in (k s ) 
is slightly higher at z=7. It could come from either the 
fact that there were less sources (n sim — 20) or that the 
estimator was less optimal in the works of LP08. Note 
that since we used a Gaussian estimator in LP08 and no 
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FIG. 6: Power spectra at the saturation scales of the lensing recon- 
struction noises for different redshifts. It can be seen that A?. (k s ) 
increases with steeper shape of power spectrum. The points mea- 
sured from the k saturation almost overlap with those measured 
from the Fisher information saturation because the shapes of the 
source power spectra are very close to power-laws. 



FIG. 7: The effective number densities of sources at various red- 
shifts. When the redshift is higher, the non-linearity of sources 
goes down. dn g /dz/a^, increases quickly with redshift. We also 

it 



plot the number density for a toy model where A?. (fc s ) = 0.2 for 



all redshifts. 



C _1 term was involved, the convergence of the recon- 
structed noise should be much better than that for the 
Fisher information shown in Fig. 3. Since the shape of 
the matter power spectrum persists except the non-linear 
scales shift to larger scales, we expect to see similar sat- 
uration effects at lower redshifts. This is confirmed by 
Fig. 5 and later by Fig. 6. 

As we mentioned earlier, the saturation effect resem- 
bles the Fisher information saturation effect for the ini- 
tial amplitude of the 3-D dark matter power spectrum 
in RH05. Their cumulative Fisher information increases 
at linear scales. On trans-linear scales, the Fisher in- 
formation is degenerate with that from larger scales and 
the cumulative Fisher information does not increase. It 
is possible to view their Fisher information as a special 
case of our lensing reconstruction calculation if the gra- 
dient term G equals the derivative of the power spectrum 
with regard to an overall amplitude. Neyrinck et al. [21] 
have used halo models to explain this saturation effect: 
on linear scales, the cumulative Fisher information in- 
creases with higher £, as the volume and number of halos 
do. On trans-linear scales, the 2-halo term first domi- 
nates, which washes out the fluctuation in 1-halo term. 
On non-linear scales, the contribution from small mass 
halos dominates, and the Fisher information increases 
again — at less than 1% of the Gaussian information. 
Our numerical results confirmed the reports of RH05. 

The saturation scales, as well as the non-linear scales, 
change with redshifts. To illustrate this we investigate 
the evolution of Af in at the saturation scale. In Fig. 6, 



we plot Af in (k s ) at three redshifts and we give the cor- 
responding numbers in table II. Because the optimal 
estimator is a function of the gradient of the power spec- 
trum, we will try to see the evolution of Af in (k s ) with the 
gradient of power spectrum at the saturation scale. It 
can be seen that Af in (k s ) increases with steeper shape of 
power spectrum. The points from k almost overlap with 
those from the Fisher information, because the shapes of 
source power spectra arc very close to power-laws. 



C. Effective number density 



To describe the information content gained in the 21- 
cm lensing in a more intuitive way, we define n g , the 
equivalent effective surface number density of galaxies 
which gives the same noise level at the scale where S/N 
equals to one: 



Var(/t) = 



n g ■ Area N ceU ' 



(31) 



Similarly, we can define the number of effective indepen- 
dent cells, N ce u, and an effective number density for the 
noise of je- The characteristic scale of the independent 
cell is approximately the saturation scale k s . The cell 
size is 



L cell = 24 s x[27(n NL + 3)-78f] 1 / 3 , 
L ce a = 2i:/k s x [45n^/87r] 1/3 for j E . 



(32) 
(33) 
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redshift 


estimator 


dng/dz/a^ (arcmin 2 ) 


LceaO^Mpc) 


Is 


k s 




1.25 


K 


0.16 


19. 


773 


0.28 


0.34 


1.25 


"JE 


0.13 


20. 


773 


0.28 


0.34 


3.0 


K 


0.57 


13. 


2136 


0.46 


0.19 


3.0 


7B 


0.48 


14. 


1671 


0.36 


0.15 


5.0 


K 


3.4 


7.0 


5657 


0.99 


0.16 


5.0 


JE 


5.6 


5.9 


4514 


0.79 


0.13 



TABLE II: Equivalent number densities of galaxies and saturation scales at different rcdshifts. The characteristic scale of the independent 
cell L ce u is approximately the saturation scale k s . The number density n g is calculated with Eq. (31). Because n g itself depends on the 
box-size of the simulated source and the variance of intrinsic ellipticities a^, we compare dn g /dz/a^ instead, which increases rapidly with 
redshift. 
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FIG. 8: The noise power spectra from the effective number densi- 
ties of galaxies, compared with the lensing signals. The solid lines 
are the lensing power spectra. The error bars are the contribution 
from both the lensing reconstruction noises and the cosmic vari- 
ance. The noise dominates over the signal for sources at z ~ 1 — 1.5, 
becomes comparable to but less than signal in z ~ 2 — 4, and fur- 
ther decreases at z ~ 4 — 6. This corresponds to a sensitivity to 
lens structures around z = 0.5, 1.0 and 1.5 respectively. Note that 
the noise we display here is only valid for t < t a , since we calculate 
n g in the regime where the noise has similar behavior to zero mode. 
l a are plotted as vertical lines here and in Fig. 9. 



where l s — k s x(z) is the saturation scale that increases 
with redshift. n g at different redshifts are shown in table 
II. Because n g itself depends on the box-size of the simu- 
lated source and the variance of the intrinsic ellipticities 
<7g, we compute dn g /dz/a1 instead. From the table, we 
see that the effective number density of galaxies increases 
rapidly with redshift. We also plot dn g /dz/a1 versus z 
in Fig. 7, where one can see that dn g /dz/a1 increases 
quickly with redshift. We also show the number density 
for a toy model where Af in (k s ) — 0.2 for all redshifts. 



FIG. 9: The ratio of the error bars and lensing signals. The vertical 
lines are l a , and the noise model is valid for I < i a . For i 2> 
£ a , a scaling factor b(£) need to be applied. The calculation for 
b(£) is done in LP08, and we found it decreased from 1 gradually. 
Therefore the noises at these small scales will be higher than what 
are shown in this plot. However, these scales are not of interest to 
us in this paper. The error bars for 2 — 4 bin arc at a few percents 
level for £ ~ 20 - 500. 

In Fig. 8, we compare the shot noises t 2 Cf /2tt from 
the effective number densities with the lensing signals. 
The error bars of the lensing power spectra are also plot- 
ted, which are estimated by 

^-vWS iCT+ ^'' (34) 

where A£ w 1/2 because we used I — 2" (n — 1,2, ...) 
bins, and consider a half sky survey [5] with the fraction 
of the sky f sky = 1/2. Here we assume the noise of k 
is Gaussian, i.e., the eight-point function of temperature 
is Gaussian though the temperature distribution itself is 
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FIG. 10: The lensing power spectra from different models of dark 
energy (private communication with Fabian Schmidt). The average 
rcdshift is 3.57, where the effective number density of sources is 
calculated by interpolation with the values from Fig. 7. The error 
bars are calculated in the same way as Fig. 8. 



non-Gaussian. Although not obvious, this point is sup- 
ported by numerically tests. We also neglect the non- 
Gaussianity of the lensing observables themselves, which 
is not dominant at I < 500 but could become an issue at 
higher I [22]. It can be seen that the noise dominates in 
z ~ 1 — 1.5, becomes comparable to signal in z ~ 2 — 4, 
and further decreases at z ~ 4 — 6. For rcdshift bin 
z ~ 4 — 6, we assume that dn g /dz varies slowly, and use 
rig = dn g /dz\ z= ^lS.z with Az = 2, which should provide 
the correct order of magnitude. Ideally, simulated sources 
should be generated at all redshifts between z — 4 and 
z — 6, and the total number density is then the integral 
of dtig/dz over all z. The noise is calculated similarly for 
1 — 1.25 and 2 — 4 source redshift bins. 

The noise is only valid for I < £ a ~ £ s , because we 
calculate n g in the regime where the noise has a similar 
behavior to the zero mode. We plot i a as vertical lines 
in Fig. 9. The full noise calculation can be done simi- 
larly to LP08, however the zero- mode k calculation works 
well enough because we are only interested in large scales 
(small £'s). The noise at small scales will be due to the 
increased non-Gaussianity at lower redshifts. For I > £ a , 
b(£) is smaller than 1, and the noise will be higher. 

The relative error bars are shown in Fig. 9. The er- 
ror bars for z ~ 2 — 4 bin are at a few percents level 
for € ~ 20 — 500. Schmidt [23] pointed out that linear 
scales detection of lensing for source galaxies at z ~ 1 — 3 
could be a good way to distinguish three modified grav- 
ity models from a smooth dark energy: f(R) gravity, the 
DGP model, and the TeVeS theory. We show the lens- 



ing power spectrum from different models of dark energy 
(private communication with Fabian Schmidt) in Fig. 10. 
The average redshift is 3.57, where the effective number 
density of sources is calculated by interpolation with the 
values from Fig. 7. The error bars are calculated in the 
same way as Fig. 8. Because of the high precision of the 
lensing reconstruction, we can expect to see a promising 
use of 21-cm sources to constrain modified gravity. 

Whereas we discussed here how to use 21-cm lensing to 
constrain dark energy from measuring the lensing power 
spectrum. One can also investigate a parametrized dark 
energy model by constraining wo,w a using lensing to- 
mography [22]. Another test of gravity and probe of 
dark-energy comes through the Integrated Sachs- Wolfe 
(ISW) effect [24]. In Fig. 11 we illustrate the potential 
for cross-correlating the ISW signal using the lensing of 
sources with 2.5 < z < 3. As an illustration, wc also plot 
the CMB lensing kernel. If the Ci of any of this signal 
is written as C< = / daWe(a), we here plot We(a) as a 
function of a to illustrate the sensitivity of our various 
probes. Whereas we see that the CMB lensing kernel 
overlaps widely with the ISW signal, the 21-cm lensing 
signal is a very good match to the ISW kernel. To quan- 
tify this, if we define as the cross-correlation coefficient 
between two fields X and Y as 

•f^E^/^/W- (35) 

i 

we find that the cross-correlation coefficient at t = 
20 (40) for the CMB Lensing - ISW and 21-cm lensing 
- ISW are respectively 0.98 (0.99) and 0.95 (0.96). As 
such, whereas neither are perfect, the 21-cm - ISW sig- 
nal is almost as good as the CMB - lensing. Using various 
rcdshift bins would help us increasing this signal [25] . By 
combining the different lensing sources planes, one can 
contract an optimal ISW estimator. This is the subject 
of a paper in progress. 

VI. CONCLUSION 

In this paper, we discussed the possibility to constrain 
dark energy models with the lensing from 21-cm inten- 
sity emission at redshifts z <~ 1 — 6. First we derived the 
optimal quadratic estimator for non-Gaussian sources, 
which can be constructed numerically from simulations. 
Then we investigated the reconstruction noise with a 
large number of simulations, and revealed that there is 
a saturation scale for the reconstruction noise at all red- 
shifts. We calculated the effective number densities of 21- 
cm sources and compared the corresponding shot noises 
to the lensing signals. We conclude that 21-cm sources 
are promising lensing sources, because they can be ob- 
served at high rcdshift, and may soon be mapped over 
half the sky [5]. The lensing reconstructed from 21-cm 
sources at z ~ 3, has a few percent error bars on linear 
scales at z ~ 1. These wide area dark matter maps are 
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well suited to test dark energy dynamics through cross 
correlation with the ISW effect, and can be used to con- 
strain modified gravity models. 
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FIG. 11: We plot the kernel for I = 20 as defined in the text 
for the ISW, 21-cm lensing (2.5 < z < 3) and the CMB lensing 
signals as a function of the scale factor. Whereas the CMB lensing 
overlapses with most of the ISW signal due to its width, we also 
see that the 21-cm lensing at high z has a very good overlapping 
with the ISW kernel which suggests that the 21-cm lensing from 
high z constitutes a good probe of the ISW effect. 
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